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Abstract. We investigate the statistics of the cosmic microwave background using the 
Kolmogorov-Smirnov test. We show that, when we correctly de-correlate the data, the parti- 
tion function of the Kolmogorov stochasticity parameter is compatible with the Kolmogorov 
distribution and, contrary to previous claims, the CMB data are compatible with Gaus- 
sian fluctuations with the correlation function given by standard ACDM. We then use the 
Kolmogorov-Smirnov test to derive upper bounds on residual point source power in the CMB, 
and indicate the promise of this statistics for further datasets, especially Planck, to search 
for deviations from Gaussianity and for detecting point sources and Galactic foregrounds. 
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1 Introduction 



The cosmic microwave background (CMB) is the most precise, best understood and therefore 
also most precious dataset in cosmology. Within the accuracy of present observations, the 
CMB exhibits a perfect blackbody spectrum of radiation at the temperature Tq = (2.72548 =h 
0.00057)K [1] with small, Gaussian fluctuations and a tiny amount of polarisation. The 
anisotropics and the polarisation can to a large extent be calculated within linear perturbation 
theory. Therefore, if the initial conditions are Gaussian as predicted from simple inflationary 
models, the temperature anisotropics and the polarisation should represent Gaussian random 
flelds in the sky. 

In this paper we outline a new method to test the Gaussianity of the CMB sky using the 
well-known Kolmogorov-Smirnov test. We describe how to de-correlate Gaussian variables 
so that the Kolmogorov-Smirnov test can be applied. We also discuss in detail practical 
issues one has to deal with when applying the Kolmogorov-Smirnov test to CMB data. 
Such practical issues include flnite-size effects, numerical issues for highly correlated data, 
effects from determining the covariance matrix from the same dataset which is to be de- 
correlated, and correlations between different subsets of the CMB map for which we compute 
the Kolmogorov stochasticity parameter. Not taking into account these effects can lead 
to wrong conclusions about the result of the Kolmogorov-Smirnov test. Once all of these 
issues are taken into account properly, the failure of the Kolmogorov-Smirnov test would 
be an indication that the data is non-Gaussian and it can be used to quantify higher order 
correlations. 

When applying the Kolmogorov-Smirnov test to data from the Wilkinson Microwave 
Anisotropy Probe (WMAP), we flnd no indication of such a failure; the maps seem to be 
entirely compatible with Gaussian fluctuations whose correlation function is given by the 
standard ACDM model. Note, though, that since the signiflcance of the outcome of the 
Kolmogorov-Smirnov test is limited by the resolution of the CMB map, this result might 
change with future CMB data as e.g. from the Planck satellite. 

We further use the Kolmogorov-Smirnov-test to obtain upper bounds on residual point 
source power in the CMB. The results are not yet competitive to the WMAP constraints, but 
they are derived for every band separately, without assuming a given scaling of the point- 
source power with frequency as in the WMAP papers. There are also ways of improving 
these constrains, which we plan to explore in future work. 

In the next Section we introduce the Kolmogorov-Smirnov test, discuss its modiflcation 
for correlated variables, and study in detail different effects that can change the results of 
the test. In Section 3 we apply it to simulated and measured CMB data. In Section 4, we 
use the Kolmogorov-Smirnov test to obtain upper bounds on residual radio point sources in 
the CMB. In Section 5 we conclude and discuss future applications of the method outhned 
here. 

2 The Kolmogorov-Smirnov test 
2.1 Basics 

The Kolmogorov-Smirnov test is based on the following mathematical theorem [2]: 
Theorem: (Kolmogorov) 

Let {Xi, • • • , Xn} be n independent realizations of a real random variable X ordered such 



-2- 



that Xi < X2 < ■ ■ ■ < Xn and be 

F{x) = P{X <x} (P = probability) (2.1) 

the partition function of X. The empirical partition function is given by 

r if X < Xi 
Fn{x) <^ k/n if Xfc < X < Xk+i (2.2) 
[1 iiXn<x . 

The Kolmogorov stochasticity parameter is defined by 

Xn:=V^sup\Fn{x)-F{x)\. (2.3) 

X 

Its partition function, denoted by $(n, A), 

$(n,A) = P{A„<A}, (2.4) 

converges uniformly to the Kolmogorov distribution $x(A) given by $k(A) := for A < 
and 

$^(A) := ^ (-l)'=exp(-2fc2A2) for A > . (2.5) 

/c=— CO 

lim $(n,A) = $x(A) . (2.6) 

n— ^►CO 

The Kolmogorov distribution, is independent of F. 

This theorem holds whenever the partition function F is continuous. 

We would like to apply the theorem to the temperature fluctuations AT(n) in different 
directions n, but there are two conditions of the theorem which are not satisfied a priori: 

First, the temperature fluctuations, even if they are a realization of the same stochastic 
process at each point (this is a consequence of stochastic isotropy), are not independent at 
different points. The CMB sky is correlated, 

(AT(ni) AT(n2)) = C{9) ^ , cos = ni • ns . 

Secondly, we do not know the theoretical distribution F of CMB anisotropics. We 
assume it to be Gaussian with mean zero but the variance, C(0) is determined from the 
data. Hence we compare the data with an empirically obtained distribution F. Clearly, the 
Gaussian distribution F obtained by using the variance of the data will in general be closer 
to the empirical distribution Fn than statistically expected. 

In addition, there are several practical issues such as finite-size effects and numerical 
issues for highly correlated data, which can change the results of the Kolmogorov-Smirnov 
test significantly. We discuss all of these effects in more detail in Sections 2.2 and 2.3 and in 
the Appendix. 

In the literature the Kolmogorov-Smirnov test has been applied to the CMB without 
any discussion of the above-mentioned issues [3]. We believe that the conclusions drawn 
in these papers are not reliable because of that. Recently the first point and the issue of 
finite-size effects have been addressed in Ref. [4]. We shall comment on this in more detail 
below. 
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Figure 1. The partition function for as function of the correlation parameter /3 for n = 100. 
Curves from left to right correspond to /3 = —0.4, — 0.2, 0, 0.2, 0.4. The curve for /3 = (middle, 
dark blue) agrees with the Kolmogorov distribution $k(A) (dashed black line). We have used 10000 
realizations of the model to explore the partition function <l>(100. A) for each value of p. As we have 
argued above, for positive correlations the distribution function is shifted to the right (to higher values 
of An), while it is shifted to the left for negative correlations. 

2.2 Effects of correlated variables, finite-size, and empirical determination of 
parameters 

It is not difficult to guess what correlations will do to the Kolmogorov stochasticity parameter: 
positive correlations will favor the variables Xj to cluster. For a fixed number n they thus 
explore less of the space of values than uncorrelated variables and therefore Fn is expected 
to deviate more from the theoretical distribution F and the Kolmogorov parameter A^^ will 
in general be larger. If the correlations are negative the opposite effect is expected and the 
stochasticity parameter A^ is reduced. 

This result is illustrated in Fig. 1, where the partition function of A^ = y^sup^ \Fn{y) — 
F{y)\ is show for n = 100 for a model with nearest neighbor correlations, 



We draw independent random variables Xj from a Gaussian normal distribution with vanish- 
ing mean and variance one, A^(0, 1). The correlated variables Yj are obtained by following the 
opposite of the procedure outlined in Section 2.3 below on uncorrelated variables. Another, 
analytically solvable model is presented in Ref. [5]. Note that throughout this paper, we 
will denote correlated random variables by Yj, whereas Xj are understood to be independent 
variables drawn from A^(0, 1). 

To compare with the correlated results we also show the partition function $(n. A) for 
the uncorrelated variables X in the upper left panel of Fig. 2. This is useful in order to 
separate effects from correlations from those of finite size (i.e. small n). For n = 100, the 
finite-size effects are already very small (the curve is already very close to the Kolmogorov- 
curve). However, if we compute the mean of $k(A^) for n = 100 from m = 10000 samples, 
it is ^ 0.48 rather than 0.5, as we would expect for the limiting case of n ^ oc. This 

is only a difference of a few percent, however, this difference will become relevant for us later 
on. We study the effects of small n in much more detail in Appendix D. 




if \i-j\ > 1 
/3 if |z-j| = l 

1 if i = j . 



(2.7) 
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Figure 2. Upper left: Partition function $(n, A) for n independent random varibales X for n = 10, 
100, and 1000. Superimposed is the Kolmogorov function 

Upper right: <l>(n. A) for n independent random variables if the partition function F is taken to be 
F = A^(a, b) where the mean a and variance b are determined from the data. The number of Xi used 
to determine a and b is twice the one used to compute X^^ n' = 2n. 
Bottom: Same as in the middle panel, but with n' = n. 

Intuitively it is also clear what happens if we determine the parameters of the theoretical 
distribution function from the data which we also use to calculate F^: on average the distance 
A^ between F^ and F will be smaller and therefore also $k(A^) is smaller, see also [6]. In 
the upper right and bottom panels of Fig. 2, we show the partition function $(n, A) obtained 
if F is not taken to be A^(0, 1) but N{a^ h) where we determine the mean a and the variance 
h from the data, 



1=1 1=1 



(2. 



In this case, Fn is by construction too close to F, thus moving A) to the left. This 
effect is quite strong if we determine the parameters from the same set of Xi that we use to 
compute A^ (c.f. bottom panel of Fig. 2, where we used n' — n) and becomes less pronounced 
if we use additional Xi to compute the mean and the variance (upper right panel, n' — 2n). 

We suspect that such an effect is the reason why the authors of Refs [3] have obtained 
so small values for ^K{^n) especially for relatively small n, see also [4]. With increasing n 
the value $k(A^) calculated in Refs. [3] grows, approaching unity in two cold spots with 5° 
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radius. We suggest that this is due to the correlations of the temperature anisotropics in this 
region. In the next Section we shall show this quantitatively. 



2.3 De-correlating Gaussian variables 

Let us consider an n-dimensional Gaussian variable Y with vanishing mean but with corre- 
lations. Its probability density is then given by 



(2.9) 



where C is the symmetric, positive definite correlation matrix, 

{YiYj) = J (Ty yiyj fy (y) = dj . (2.10) 
We can diagonalize this matrix with some orthogonal matrix O so that 



(2.11) 







■ 




C = OMO^ M = 


■ 


. 
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If Y is Gaussian, the n variables Xi defined by 

Xi = -{O^Y)i (2.12) 

Mi 

are uncorrelated and distributed with the normal distribution A^(0, 1). On these variables 
the Kolmogorov-Smirnov test should apply, i.e., the partition function of the stochasticity 
parameter \n should converge to $x foi" large n. 

If the partition function A) for obtained by comparing the empirical distribu- 
tion of X with A^(0, 1) does not converge to the Kolmogorov function $x there are two 
possibilities. 



We have not used the correct correlation matrix C (which is typically determined by 
the cosmological parameters). 

The fluctuations are not Gaussian, i.e. there are also higher order correlations such that 
even though (XiXj) = 6ij, Wick's theorem is not satisfied for higher order correlators. 



In either case this would falsify the hypothesis that CMB fluctuations are well explained 
by Gaussian inflationary perturbations in a ACDM model with the cosmological parameters 
inferred in the literature. 

We must, however, note that even though we do determine the correlation function from 
the cosmological parameters and not directly from the data, these parameters are inferred 
using the same data and are therefore not independent of it. Also in this approach, we 
are therefore somewhat subject to the problem that we determine the distribution of our 
variables from the data itself. In Appendix B we show that for the case of the CMB we do 
not see this effect. The reason is that the ACDM model puts a strong prior on the form of 
the correlation function, leaving only six parameters to be determined from a large amount 
of available CMB data. 
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Note also that the de-correlation transformation, eq. (2.12), has its limitations for very 
strongly correlated random variables. In the limiting case of two perfectly correlated random 
variables, li = 12, the covariance matrix C has one zero eigenvalue, so that we cannot 
compute Xi for this eigenvalue. In such a case, one needs to directly work with the correlated 
variables y, and compare the partition function of their Kolmogorov parameter to simulated 
ones. This is what we call 'correlation method' in the following.^ In cases where we do 
not have perfectly correlated data, but nevertheless very strong correlations (e.g. between 
neighbouring pixels), we can still run into numerical problems with the transformation in 
eq. (2.12), since we divide by a very small number /x^. We investigate this effect for CMB 
maps with negligible noise in Appendix C: The empirical partition function of the Kolmogorov 
stochasticity parameter for the de-correlated variables can be significantly shifted to the right 
due to very strong correlations. 

3 Application to the CMB 

In this Section we apply the Kolmogorov-Smirnov test to simulated data and to the seven 
year CMB anisotropy data from WMAP. 

3.1 Simulated data 

We start by calculating the CMB correlation function using the best fit cosmological param- 
eters of WMAP, as they are given in [7], see Table 1. The last digit given is rounded and 



Us = 0.96 scalar spectral index 
Ar = 2.4 X 10~^ amplitude of the curvature 

perturbation spectrum 
r = tensor to scalar ratio 

h'^Qb = 0.0226 baryon density 
h'^Q^ = 0.111 cold dark matter density 
T = 0.088 optical depth to the 

last scattering surface 
Q\ = 0.73 cosmological constant 
— 0.00 curvature parameter 



Table 1. The (minimal) cosmological parameters from the WMAP seven year data [7]. 

is uncertain. More details about the error bars and the definition of these parameters can 
be found in Ref. [7]. In Fig. 3 we compare the CMB angular power spectrum obtained with 
these parameters with the WMAP seven year data. A definition of the CMB angular power 
spectrum can be found e.g. in Ref. [8] 

The correlation function for the temperature anisotropy of two directions ni and 112 in 
the sky can be calculated from the CMB power spectrum, see [8], 

^ 00 

(Ar(ni)Ar(n2)) = — ^^(2^ + l)QP,(ni • n^) , (3.1) 

^ i=2 

where denotes the Legendre polynomial of degree i. The correlation function obtained 
from the spectrum of Fig. 3 is shown on the right hand panel. 

^An analytical investigation of such models with perfect correlations can be found in [5]. 
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Figure 3. The C^'s (left) and the CMB correlation function (right) from the WMAP seven year best 
fit cosmological parameters given above and the WMAP data with error bars, {jaK)'^. The 'kink' in 
the correlation function at 6> = 1° corresponds to the acoustic peaks in the power spectrum. 



We now simulate datasets consisting of n CMB-'pixels': We draw n independent real- 
izations X(n^) from the normal distribution A^(0, 1), which we then correlate with the help 
of the correlation matrix 

Cij = CiOij) , (3.2) 

following the opposite of the procedure outlined in Section 2.3. Here, Oij is the angular 
separation of the points and uj. For the sake of simplicity, we assume all of our data-points 
to lie on a great arc of opening angle a.^ For a given number n of points, nearest neighbors 
then have an angular separation 8 — ajn and we have to include at least ^max > rm/a C^s 
in Eq. (3.1) in order to be able to resolve this scale. In practice, we use ^max = 2000 for the 
plots in this Section. ^'^ For these CMB datasets, we obtain the partition function $(n. A) of 
from 10000 simulated datasets of n 'pixels' each. 
On the left plot in Fig. 4 we show the partition functions $(n. A) of the temperature 
fluctuations for points on a great arc with fixed total opening angle a = 10° for different 
values of n. As the number of points, n, increases, the angular distance between nearest 
neighbors, 5 — a/n^ decreases and they are more and more correlated, see Fig. 3. As we have 
explained in Section 2, this means that the probability for higher values of A^^ becomes larger 
and hence the function $ is shifted to the right with respect to the Kolmogorov function 
$x which is also indicated as the dashed line in Fig. 4. Therefore, if in a fixed patch the 
number of points is increased, the typical value of A^ increases and with it $k(A^). At this 

^ Note that working with data lying on a great arc is especially useful when we consider the full circle: 
To transform the uncorrelated variables X to the correlated ones given by Yi = iii{OX)i where O and in 
are defined as in Eq. (2.11), we have to diagonalize the correlation matrix C. For large values of n this 
requires a significant numerical investment: Diagonalization of a symmetric n x n matrix determining both, 
the eigenvalues /xf and the orthogonal matrix O requires operations. To sample the variable Y and therefore 
An m times requires 0(m • n^) operations. If however the points are equally spaced on a circle such that 
the angular separation of ni and Un is equal to the resolution the matrix C becomes 'circulant' and its 
diagonalization corresponds to a Fourier transform. In this case the number of operations can be reduced to 
0{m • nlog(n)) which is of course a significant improvement. For more details see [5]. 

^For n = 250 and n = 500 and an opening angle of a = 10°, ^max = 2000 is not enough to completely 
resolve these scales; however, the result that a higher resolution of the map yields larger values of An is not 
affected by this. 

^In all of the following sections, we include the beam of the WMAP-experiment, which cuts off power at 
i - 500/700/900 for the Q/V/W-band, respectively 
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Figure 4. Left: The partition functions <l>(n,A) for n points of simulated data in a total opening 
angle of a = 10° are shown for n = 30 — 500. For higher n the points are closer and therefore more 
correlated. Hence with increasing n, <l>(n, A) deviates more from the Kolmogorov function <l>x(A). As 
explained in the text, positive correlations prefer larger values of A. 

Right: The partition functions <l>(n, A) for n points of simulated data at fixed resolution 5 = a/n = 20 
arc minutes. Until an opening angle of a = 70°, which corresponds to n = 210, the correlations 
increase with n and $(n,A) moves to the right, preferring larger values of A. For larger angular 
separation the anti-correlations present for angular separations 50° ^ ^ 100° come in and balance 
the correlations, such that <l>(n. A) is kept at roughly the same position up to an opening angle of 
a = 180°. For a = 360°, corresponding to n = 1080, the correlations have moved <l>(n. A) further to 
the right again. 

In both plots the Kolmogorov function is also indicated (dashed black line). 



level, this has nothing to do with non-Gaussian features of the CMB but is simply an effect 
of correlations. 

On the right plot in Fig. 4 we fix the resolution to 5 = 20 arc minutes and simply 
increase the number of points. In this case, the variation of $(n, A) with n is much less 
pronounced. Here it comes from the fact that the total opening angle a = n6 depends on 
n. For n < 210, which corresponds to a < 70°, the function $ is still moving to the right 
with increasing n, i.e., correlations are becoming stronger; while for 210 < n < 540, i.e. for 
70° < a < 180° the small anti-correlations which are present for angles 50° ^ < 100°, see 
Fig. 3, balance the correlations such that the curves $ stay at the same position, or even 
move slightly to the left. For n = 1080, corresponding to a = 360°, the correlations have 
moved $ further to the right again. Note that the partition functions for a = 10° and 20°, 
corresponding to n = 30 and n = 60 are still subject to effects of small n, which shifts their 
partition functions to the left. 

3.2 WMAP - local analysis 
3.2.1 General remarks 

In the previous Section we have studied the partition function $(n. A) for ring segments 
of a given opening angle of a simulated CMB map. Here we want to locally analyze the 
observed data. In the following we denote a A^^ computed locally for a given patch or ring 
segment consisting of n pixels by A|^^, as opposed to the global A^^, which we introduce in 
Section 3.3. A|^^ could turn out to be useful for detecting local non-Gaussianities such as 
Galactic foregrounds, see also [3, 4]. 
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We have two possibilities to apply the Kolmogorov-Smirnov test to the WMAP data: 
We can either de-correlate the pixels in a given patch or ring segment using Eq. (2.12) and 
compare the A^^, to the Kolmogorov distribution Or we can directly work with the 

(correlated) temperature values in the CMB map and compare to our simulated partition 
functions $(n,A) of a correlated signal plus noise. In the following, we will present both 
methods. For the 'correlation method', we will work with ring segments as we did for the 
simulations described in Section 3.1, because this facilitates the simulation of the partition 
function. For the 'de-correlation method', we will use patches because they are better local- 
ized than the ring segments, thus being more suitable for a local study of the CMB. However, 
in Appendix A, we will also perform the de-correlation method on ring segments, in order to 
compare the results to the ones obtained from the correlation method. 

In general, we consider the 'de-correlation method' to be preferable over the 'correlation 
method'. First of all, in this case there is no need to simulate the partition function of since 
it is the well-known Kolmogorov distribution. The de-correlation method is also generally 
more powerful than the correlation method, since it is less degenerate in the detection of 
the different effects that can cause a deviation from the Kolmogorov curve. Consider, for 
example, a dataset for which the empirical partition function of the A^'s lies to the right 
of the Kolmogorov curve $x(A). This could be due to correlations or to other effects such 
as non- Gaussianity or to the fact that we have used a wrong F(x) when computing the 
A^'s. In simulations of correlated data, we can probably mimic this effect by using a certain 
correlation function. However, if the shift of the curves is not due to correlations in the data, 
we would in general not expect to obtain the Kolmogorov distribution when de-correlating 
the data with any correlation function. Furthermore, with the de-correlation method it is 
straightforward to account for inhomogeneous noise. 

However, note that the correlation method is more stable in the case of very strongly 
correlated pixels. Pixels can be very strongly correlated if the noise in the CMB map is 
negligible. In that case, the de-correlation method can lead to highly biased results, as we 
show in Appendix C. If the eigenvalues of the covariance matrix are extremely close to zero, 
it might even not work at all. For WMAP, we do not encounter this problem, because we 
have enough noise power on small scales. However, for future CMB experiments, with less 
noise, this issue might become relevant. 

For both methods, we compare the results obtained from WMAP data to simulations 
of 100 CMB maps plus Gaussian white noise (mimicking the detector noise). For the de- 
correlation method, we use the correct (inhomogeneous) noise variance of the maps, whereas 
for the correlation method, the noise variance has been averaged over all pixels used in the 
analysis. In the correlation method, adding uncorrelated noise to the simulations of correlated 
data reduces the correlation of the data and thus shifts the partition function $(n. A) to the 
left as compared to simulations without noise. As for the de-correlation method, the inclusion 
of noise just slightly broadens the scatter of the curves without introducing a bias. 

We apply the Kolmogorov-Smirnov test to the foreground-cleaned seven year WMAP 
Q-band (40.7 GHz), V-band (61 GHz), and W-band (94 GHz) maps. 

3.2.2 Beam and detector noise 

When computing the correlation function from the CMB power spectrum, we need to take 
into account the beam transfer function of WMAP. This cuts off power at high and it 
significantly reduces the power even in the first peak, see e.g. [9]. Note that the correlation 
function shown in Fig. 3 is computed without taking into account the WMAP beam. 
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Figure 5. Left: Map of $x(Aj^^) for the V-band for the de-correlation method. The color scale goes 
from to 1. As expected, outside of the galaxy, this map is very uniform. 

Right: Map of Aj^^^ for the V-band for the correlation method. The color scale goes from to 8. In 
this map structure not due to the galaxy but due to correlations is clearly visible. 

Finally, we need to take into account the detector noise of WMAP. If we assume that 
the maps have been perfectly cleaned from Galactic foregrounds and point sources, the data 
(i.e. the pixels of a given ring segment) are given by 

Y, = AT, + n, , (3.3) 

where ATi and rii denote the CMB temperature and the detector noise in pixel i, respec- 
tively. If we ignore small non-Gaussianities coming e.g. from inflation, the data are Gaussian 
distributed, with the covariance matrix 

Cij = {YiYj) = {ATiATj) + Nij , (3.4) 

where {ATiATj) is the CMB correlation function shown in Fig. 3, and N is the diagonal 
noise covariance matrix 

Nij = {niTij) = cr| 6ij . (3.5) 

Here, a? = cro/^obs(0 is the detector noise variance in pixel z, which is inversely proportional 
to the number of observations of a given pixel, A^obs(0 ^• 

3.2.3 Results: de-correlation method 

For analyzing the WMAP data locally using the de-correlation method, we choose our patches 
to be the pixels of a Healpix-map with NSIDE=32, which gives us 256 pixels per patch. We 
discard all pixels masked by the WMAP extended mask, which masks also identified point 
sources outside our Galaxy, and we discard a patch if more than 10 per cent of the pixels in 
a patch are masked. This leaves us with m = 7592 patches. We de-correlate the data from 
every patch using Eq. (2.12), which should give us independent Gaussian random variables 
Xi ^ A^(0, 1). Then, we compute the local X^^^ for every patch. The obtained map of 
$x(A}^^) for the V-band (without applying the mask) is shown in Fig. 5. Our Galaxy is 
clearly visible, but outside the Galaxy it looks like white noise, as it should. The maps for 
the other bands look very similar. In the following, we apply the mask as explained above. 

In order to get a visual impression of the distribution of the A|^^ of the different patches, 
we show their empirical partition function together with the Kolmogorov-curve $x(A) in 

^fjo for every frequency band can be found at http:/ /lambda. gsfc.nasa.gov/product/map/dr4/skymap_info.cfm 
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Figure 6. De-correlation method: empirical partition function of Aj^^*^ for WMAP-7year-data and 
100 simulations for the Q-band {upper left)^ the V-band {upper right) ^ and the W-band {bottom left). 
We have m = 7592 patches and n = 256 pixels per patch if none of the pixels is masked. We mask 
pixels using the WMAP extended mask; a patch is discarded if more than ten per cent of its pixels 
are masked. The simulations are shown in green. 

Bottom right: histogram of for all bands. We see a small bias towards small values of A (larger 
<l>(n,A) at fixed A), refiecting the effects of small n. 



method 




Q V W 


local (patches) 




0.488 0.489 0.490 




stddev[$i^(Aj,°^)] 


0.290 0.293 0.292 


global 


mean[$x(An)] 


0.508 0.505 0.500 




stddev[$x(An)] 


0.291 0.284 0.284 



Table 2. Empirical mean and standard deviation of ^K{^n)- Top: local analysis (de-correlation 
method), n ^ 256, m = 7592. Bottom: global analysis, n = 2000, m = 500. The empirical mean 
should be Gaussian distributed around {^k) = 0.489 (0.495) with a standard deviation of a ^ 0.0033 
(0.013) for the local analysis (the global analysis). The empirical standard deviation should be about 
y/ {{^K — {^k))'^) = 1/a/12 ~ 0.29 for both. Both analyses are fully consistent with theoretical 
expectations. 

Fig. 6. Also shown are the analysis from 100 simulated CMB maps to which Gaussian 
white noise has been added. For the simulations we consider homogeneous noise, whereas we 
account for inhomogeneous noise when de-correlating the real data. All three bands of real 
data lie well within the expected variance. Note that the simulated curves are biased to the 
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left due to finite-size effects, as we show in Appendix D. 

If the theoretical partition function F{x) used to compute A|^^ is the correct one, the 
$k(AJ^^) are uniformly distributed on the interval [0,1]. Thus, 

{^k) = 0.5, 

Vii^K - {^k)?) = I/V12 ^ 0.29 , (3.6) 

where (•) denotes an ensemble average and we have used $x = ^xi^n^) foi" notational 
simplicity. Assuming that the different realizations of $x are statistically independent and 
using the central limit theorem, $x = ^ Ya^i ^K,i is then normally distributed around 0.5, 
with a standard deviation of 0.0033 for m = 7592. However, note that this is 

not quite correct for n — 256, due to finite-size effects, as we show in Appendix D. We show 
with simulations that, for n = 256, $x = ^ Y^=.\ ^K,i is distributed around 0.489 rather 
than around 0.5, i.e. it is biased low by about 3 a. In Fig. 6, we also plot the histogram of 
$x for all three bands. We do see the bias towards small $x from the finite-size effects. 
The values for the empirical mean and standard deviation of $x are summarized in the top 
panel of Table 2. The results are consistent with theoretical expectations. 

3.2.4 Results: correlation method 

Instead of de-correlating the data from the CMB map, we now directly compare the empir- 
ical partition function of A|^^ for the original correlated data with our simulated partition 
functions A) for correlated data. We show a map of Aj,°^ (nside=64) in Fig. 5. Except 
for the Galaxy, the structures that we see in this map simply reflect correlations in the CMB. 
This can be seen from the fact that these structures go away when we de-correlate the data 
(cf. left panel of Fig. 5), leaving us with white noise. The Galactic plane is visible in the 
de-correlated map as well, indicating that we are seeing radiation from the Galaxy. 

In order to compare the consistency of the results with simulations, we go back to 
working with ring segments. As compared to the simulations in Section 3.1, we modify our 
analysis for data that do not lie on a great arc, but on a circle in a plane parallel to the 
Galactic plane, characterized by an angle Oring in Galactic coordinates. With this we can 
directly use the data from the WMAP temperature maps in the HEALPix 'ring' format. 
This has the advantage that we do not need to rotate the map in order to obtain data on 
a great arc that lies well above the Galactic plane (rotating the maps introduces significant 
uncertainties in the data), while the simulation of the partition function is still easy to do. 

From these rings, we choose segments of an opening angle of 20° and a given gap 
angle between the segments, in order to reduce correlations between them (both angles are 
measured with respect to the centre of the coordinate system). We choose the gap between 
the ring segments and between different rings to be 1° each. In Appendix E, we verify with 
simulations that for such a gap we do not yet see any effects from correlations between the 
different ring segments.^ 

In order to obtain as many ring segments as possible, we use all rings between Oring = 
20° — 60° with a distance of 1°. We discard all pixels with point sources masked by the WMAP 
extended mask. If in a ring segment more than 10% of the pixels are masked, we discard 

^We expect correlations between ring segments to manifest themselves in too steep curves for $(n, A): 
Correlated ring segments 'know about each other', which should make the An for the different ring segments 
more similar to each other, and thus the cumulative distribution for the An steeper. Note that since the 
patches are much better localized than the ring segments, they should be much less affected by this effect. 
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Figure 7. Correlation method: empirical partition function of Aj^*^ for WMAP-7year-data for the 
Q-, V-, and W-band (upper left, upper right, and bottom panel, respectively). We compare these 
to simulated partition functions $(n,A), obtained from m = 10000 samples, (grey curve) and 100 
realizations of n = 143, m = 712 (green curves). The opening angle of a given ring segment is 20°, 
the gap between rings and ring segments are 1° each. Pixels containing identified point sources are 
masked, and we take into account only homogeneous noise. The Kolmogorov- curve ^k(A) is indicated 
by the black dashed line. 



the whole ring segment, thus obtaining m = 712 ring segments. Due to the pixelisation 
scheme, the number of pixels contained in a given opening angle depends on the ring. For 
an opening angle of 10°, the number of pixels we use varies between n = 132 and n = 150 
for Oring = 20° — 60°. Furthermore, the masking of the pixels changes the number of pixels 
in a given ring segment. The variation in n causes small differences between the simulated 
partition functions of the A^^ for the 'correlation method' introduced below. However, we 
have verified numerically that this effect is negligible compared to the variation we obtain 
due to the finite number of ring segments we can extract from the map. For the simulations, 
we use the average number of pixels in the ring segments, n = 143. 

We now consider only homogeneous noise. The effect of using the wrong noise variance 
in the correlation method actually introduces a bias, whereas in the de-correlation method 
it merely increases the scatter of the simulations. We have studied this effect by excluding 
the regions where the noise is most inhomogeneous, i.e. the regions around the ecliptic poles, 
from the analysis.^ This left us with m = 357 ring segments only, rather than m = 712 as 
in the de-correlation method, thus worsening the statistical power. We found that the bias 

^Around the ecliptic poles, the WMAP scanning strategy affects the structure of the noise most strongly. 
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band 


Q V W 


mean[$i^(Aj^'')] 


0.90 0.88 0.86 


stddev[$i^(Aj,°^)] 


0.19 0.22 0.25 



Table 3. Results for the three WMAP bands for the correlation method for m = 712 samples 
(n ^ 143): shown are the empirical mean and standard deviation of ^xi^n^) over all ring segments. 
Note that now we obtain larger means for <l>x(Aj^^), simply because the X^^^ are larger due to the 
correlations between the pixels in a given ring segment. The standard deviation is smaller than 0.29, 
due to the flatness of the Kolmogorov-curve for large values of A^. 

is not significantly removed by this and therefore included also the pole region in the final 
analysis. 

The results for the different bands are shown in Fig. 7. The curves obtained from 
the WMAP data are still marginallly consistent with the simulations, but they are now 
systematically too steep. This is most probably due to the fact that we only considered 
homogeneous noise in this analysis, while taking into account the correct noise covariance in 
the de-correlation method.^ 

In Table 3, we show the empirical mean and standard deviation of $x(A|^^), where $x 
is the Kolmogorov distribution, and not the correct partition function for the correlated case. 
As expected, we obtain > 0.5, simply because the A|^*^ are pushed towards higher 

values by the correlations within the ring segments. For the typical values of A^^ ^ 0.9 we 
obtain, the Kolmogorov-curve is already quite flat. Thus, $x(A|^^) varies little over the ring 
segments, leading to a standard deviation of $x which is smaller than 0.29. 

3.3 WMAP - global analysis 

In order to find out whether the CMB map as a whole is Gaussian distributed and described 
by the correlation function used, we compute a global A^ of the CMB map: We randomly 
draw n = 2000 pixels outside the mask from the map, de-correlate all of these pixels with 
their full covariance matrix (using inhomogeneous noise), and compute the global A^ from 
the resulting uncorrelated data. We compute m = 500 of these global A^. The values for n 
and m where chosen to maximize the sensitivity of A^ to point-source contamination, while 
still being safe from residual correlations between the different sets of pixels, as we explain 
in Section 4. 

In order to get a visual impression of the results, we show the empirical partition function 
of Xn, together with 20 simulated partition functions in Fig. 8. The WMAP-curves lie well 
within the range expected from simulations. 

Just as for the local analysis in Section 3.2.3, the $k(A^) have to be uniformly dis- 
tributed on [0,1], if all our assumptions used to compute A^ are correct. We plot the his- 
togram of $k(A^) for n = 2000 and m = 500 for all bands in the bottom right panel of 
Fig. 8. 

In order to obtain a more quantitative result, we again compare the empirical mean and 
standard deviation to the theoretical expectations, c.f. Section 3.2.3. Taking into account the 
very small bias from the effect of a finite n (c.f. Appendix D), we expect $x to be normally 
distributed around 0.495 with a standard deviation of a ^ 0.013. The empirical standard 
deviation of $x should be about 0.29. We summarize the empirical mean and standard 

^When considering only homogeneous noise in the de-correlation method, we also obtain curves which are 
less consistent with simulations than when taking into account the full noise covariance. 
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Figure 8. Global analysis: empirical partition function of A^*^ for WMAP-7year-data and 20 simula- 
tions for m = 500 sets of n = 2000 randomly drawn pixels. Upper left: Q-band, upper right: V-band, 
bottom left: W-band. 

Bottom right panel: Histogram of for the three bands. 



deviation of $x(A^) in Table 2. The results are again fully consistent with theoretical 
expectations. 



4 Constraints on residual point sources 

One can in principle use the Kolmogorov-Smirnov test to constrain the power coming from 
residual (non-resolved) point sources. We consider only radio point sources, which are spa- 
tially uncorrelated (white noise) and thus have a flat power spectrum. Toffolatti et al. [10] 
show that the residual radio point sources are well described by a Poisson distribution. In 
this work, we approximate this Poisson distribution by a Gaussian, which is a good approx- 
imation if the Poisson distribution has a large mean. We can then simply add a constant 
contribution Cf^ = const to the CMB power spectrum and re-run the Kolmogorov-Smirnov 
test. We plot different point source power spectra together with the CMB power spectrum 
for the WMAP seven year best fit cosmological parameters in Fig. 9. 

Using the spectral behaviour of the detected point sources, the frequency dependence 
of Cf^ can be modeled as a power law [11] 

Cf(,,) = A^sriuf(^^j , (4.1) 
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Figure 9. CMB power spectrum and 3 different constant power spectra, describing the residual point 
source background. The WMAP-constraints on the power spectrum of residual point sources is of the 
order of O.Ol/ii^^sr or better. 



where a = —0.09, uq = 40.7 GHz is the frequency of the Q-band, and r{u) converts antenna 
to thermodynamic temperature: 

r(z/) = — 2"^^ ' ^ = TTf^ — • (4-2) 

The WMAP seven year estimate for the amplitude is A^^ = (9.0 ±0.7) x 10"^/xK^sr [7]. The 
constraints on C^' on the different bands obtained from Eq. (4.1) given this result for Apg 
are listed in Table 4. 

In order to obtain constraints on the residual point sources from the Kolmogorov- 
Smirnov test, we add a constant point-source power spectrum to the CMB power spectrum 
and compute again the m = 500 values for the global with n — 2000, as in Section 3.2.3. 

As we have already stated above, the values for n and m where chosen to maximize the 
sensitivity of to point-source contamination, while still being safe from residual correlations 
between the different sets of pixels: We found that the sensitivity of A^ to a contamination 
of the CMB power spectrum by point sources is much higher for large n than for small n. 
This can be understood as follows: The effect of adding point-source power to the CMB- 
power spectrum is to change the partition function F[x) of our de-correlated pixels. This 
increases the value of sup^ \Fn{x) — F{x)\^ c.f. equation (2.3). Now, since for large n the 
original supremum (without point-source power added) is smaller than for small n, it feels 
the change of F{x) stronger (i.e. its relative change is larger than for small n). We thus 
chose to set n — 2000, for which we can still diagonalize the covariance matrix in a reasonable 
amount of time. In addition, for n — 2000 pixels, the effects from finite n are already very 
small, c.f. Appendix D. 

The number of pixels in our map is A^pix = 12nside^ ^ 3 x 10^ for nside = 512. With 
roughly 30% of the sky masked, we are left with ^ 2 x 10^ pixels outside the mask. Therefore, 
every set of n = 2000 pixels we draw represents about 0.1% of the pixels outside the mask. If 
we draw m — 500 such pixel-sets, we have in total used about 50% of the pixels outside the 
mask (neglecting the fact that some of those might have been drawn several times). We show 
in Appendix E that with this we are safe from effects from residual correlations between the 
different pixel sets. 
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Figure 10. Empirical mean of ^^(An) versus the Poisson power for the Q-band {upper left)^ 
V-band {upper right) ^ and W-band {bottom), for n=2000 and m=500. The confidence intervals around 
the expected value of 0.5 are also shown. 



band 


Q 


V 


W 


WMAP 7 yr 


(9.8 ±0.8)10-3 


(2.0 ±0.2)10-3 


(0.43 ±0.03)10-3 


Kol. (de-corr) 


< 0.05 


< 0.02 


< 0.05 



Table 4. Constraints on C^^ in /iK^si from the WMAP team and from our Kolmogorov analysis. 



As in Section 3.3, we again use the fact that the empirical mean $x has to be normally 
distributed around 0.5 with a standard deviation of a ^ 0.013 if we used the correct partition 
function F to compute A^.^ If we add too much point-source power, this hypothesis is no 
longer true, and $x can become very different from 0.5. We thus add point-source power 
C^^ of different strengths to the CMB power spectrum, and plot the resulting $x versus C^^ 
in Fig. 10. The la, 2(j, and 3a regions around the expected value of 0.5 are also shown. We 
obtain an upper bound on the point source power by rejecting the hypothesis if our measured 
value of $x lies outside the 3a region. The bounds we obtain for every band are listed in 
Table 4. The Q-band is affected least by the point sources, since it has the lowest resolution, 
and thus the beam window function cuts the higher multipoles, when the point sources are 
more important as compared to the CMB. 

Our results obtained in this way are about 0.5 — 1 order of magnitude worse than the 

^Note that we neglect the small bias from finite n here. This results in slightly more conservative constraints 
than with the bias taken into account, since $k is biased towards smaller values. 
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WMAP limits. However, note that the WMAP team has combined different bands and 
used an assumption about the scahng of Cf^ with frequency to obtain their constraints. We 
do not use any assumption on this and derive independent constraints for each band. We 
could most probably improve the constraints by using a higher n for computing A^^, thus 
enhancing the sensitivity of to residual point-source power as explained above. However, 
as already stated above, with n = 2000, we are already close to the limit of what we can 
do with the matrix diagonalization routine we are presently using. Therefore, working with 
a larger n would involve using more sophisticated routines and is beyond the scope of this 
work. Note also that our limits are quite conservative, given that they were obtained using 
only 50% of the pixels outside the mask, i.e. about 35% of the sky. We could also improve 
our constraints by using more than 500 samples, but we would run the risk of working with 
correlated samples (if we were to draw the additional samples outside our current mask), or 
of being contaminated by Galactic foregrounds (if using a smaller mask). We prefer to be 
safe from these effects and thus stick to the more conservative estimate. Finally, note that we 
have chosen to reject the hypothesis if the data lie outside the 3a region around the expected 
value, which is again quite conservative. 

5 Conclusion and outlook 

In this paper we have shown that the Kolmogorov stochasticity parameter of the correlated 
data Ar(n) does not obey a Kolmogorov distribution and we have explained why this is to be 
expected. We have also presented a method to de-correlate the data so that the Kolmogorov- 
Smirnov test on the de-correlated data should give an indication whether or not the CMB 
anisotropics are Gaussian. Alternatively, we can apply the Kolmogorov- Smirnov test directly 
to the correlated data, but with a modified partition function, which takes into account the 
correlations and which is obtained from simulations. This latter method is to be favoured 
over de-correlating the data in the case of very strongly correlated data. 

We have applied the Kolmogorov- Smirnov test on WMAP seven year data using both 
of these methods, and we have shown that the WMAP maps are fully consistent with a 
Gaussian dataset with the correlation function given by the best-fit cosmological parameters. 

We have then used the Kolmogorov-Smirnov test to obtain conservative constraints 
on the constant power spectrum of residual radio point sources. With the numerical effort 
involved and the present resolution of the maps, our results are not competitive with the 
standard analysis but we are convinced that this method has considerable potential. 

Given that the spherical harmonics coefficients of an isotropic sky map are un- 
correlated, they are probably a better choice of variables to constrain residual point source 
power in the map. However, this is beyond the scope of the present work, and presumably 
mostly useful for higher resolution maps as we expect to obtain from the Planck satellite 
currently taking data. These maps may make this method very competitive and certainly an 
interesting alternative to the standard analysis. 

The Kolmogorov-Smirnov test can also be used to detect Galactic foregrounds, such as 
synchrotron radiation or dust. This should of course be done locally in pixel space. 

One may also use the fact that the a^^n's should be independent Gaussian variables with 
variance Q to determine cosmological parameters. For a given model determined by a list 
A of cosmological parameters, e.g., A = (rig, A^^, Qch'^, fi^/i^, r) we can test whether the 
ratio of observed a^^'s and the theoretical C^s are independent with distribution iV(0,l). 
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More precisely, the quantities 



Re[aim] 



and 



lm[aim] 



should be distributed with A^(0, 1). One may define a Markov Chain Monte Carlo (MCMC) 
algorithm which minimizes the distance of the distribution of the resulting Kolmogorov 
stochasticity parameter with the Kolmogorov distribution. It is easy to obtain the best 
fit in this way, but to determine the errors will probably require some thought. A simple first 
guess is to use least square deviation of the functions ^{X^A) and This corresponds 

to assuming Gaussian errors and may not be so bad if one has sufficiently many points A, 
due to the central limit theorem. The width of the distribution, A$(A) can be determined by 
simulations using the experimental errors. Even if this method might turn out to be slower 
than the MCMC algorithms presently in use, it provides an independent alternative which 
always has its merits. 

Finally, one may use the normalized a^^'s to test for non-Gaussianity. If even for the 
best fit cosmological parameters, the distribution of the Kolmogorov stochasticity parameter 
deviates from the Kolmogorov distribution, this may indicate that the a^^n's are not Gaussian 
distributed. These possibilities will be explored in detail in a future project. 
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A De-correlation method applied to ring segments 

In order to easily compare the results of the de-correlation method to the ones obtained 
with the correlation method, we redo the de-correlation analysis on the ring segments used 
in Section 3.2.4 and show the results in Fig. 11 and in Table 5. Taking into account the 
finite-size effect, we expect the empirical mean of $x to be Gaussian distributed around 
{^k) = 0.484 with a standard deviation of a ^ 0.011 (cf. Appendix D). The results are fully 
consistent with theoretical expectations and, in contrast to the results from the correlation 
method, not biased. As already mentioned above, we think that the bias in the correlation 
method comes from the fact that we do not account for the correct (inhomogeneous) noise, 
but use homogeneous noise in the simulations. 

B Effect of the empirical determination of the data variance for a ACDM 



The power spectrum used to de-correlate the data and to perform the simulations of the 
correlated data, has been determined as the best-fit power spectrum for the ACDM cosmo- 
logical model from the CMB maps themselves [7]. Therefore, we expect in principle that the 

^°http:/ /lambda. gsfc.nasa.gov/product/map/current/m_products.cfm 



model 
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Figure 11. Decorrelation method: empirical partition function of A^*^ for WMAP-7year-data and 
100 simulations for the Q-band {upper left)^ the V-band {upper right), and the W-band {bottom left). 
The opening angle of a given ring segment is 20°, the gap between rings and ring segments are 1° 
each. Pixels containing identified point sources are masked. We have m = 712 ring segments and on 
average n = 143 pixels per ring segment. The simulations are shown in green. 
Bottom right: histogram of for all bands. We see a small bias towards small values of 
refiecting the effects of small n. 



method 




Q V W 


de-correlation (ring segments) 




0.496 0.497 0.483 






0.295 0.295 0.289 



Table 5. Empirical mean and standard deviation of ^K{^n) of the de-correlation method applied 
to the ring segments with n ^ 143, m = 712. The empirical mean should be Gaussian distributed 
around {^k) = 0.484 with a standard deviation of a ^ 0.011. The empirical standard deviation 
should be about y/ {{^k — {^k))'^) = 1/aA2 ~ 0.29. The analysis is fully consistent with theoretical 
expectations. 

Xn tend to be smaller as compared to the case when we work with the correct theoretical 
power spectrum, describing the Gaussian distribution function from which the data have 
been drawn. This should shift the partition function ^{Xn) to the left. This fact is known 
under the name Lilliefors effect and a test taking this into account is the so called Lilliefors 
test [6]. 

In this Appendix, we investigate this effect and show that it is negligible in our study. 
We start by simulating a Gaussian CMB map from the WMAP best fit power spectrum, 
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Figure 12. Empirical partition function of of simulated CMB data, de-correlated with the the- 
oretical (i.e. correct) Q's (red curve) and the C^s from the best-fit ACDM model to the simulated 
data (green curve). 



described by the parameters in Table 1. 

From this map, we extract the best-fit power spectrum for a ACDM model, using a 
simple likelihood in Ci and a simplex algorithm to find its maximum. The theoretical 
power spectra have been computed with cmbeasy [13]. 

We then de-correlate the simulated data once with the correct power spectrum and 
once with the best-fit power spectrum to the simulated data, and plot the empirical partition 
function of \n for both cases. The results are shown in Fig. 12. There is only a very small 
difference between the two cases, so that we can safely ignore this effect in our analysis. 

We assume that this is so because the derived distribution contains only six parameters 
(the cosmological parameters of the minimal ACDM model), which puts a strong prior on 
the correlation function, and because it has been derived using many more pixels than the 
ones used in the Kolmogorov-Smirnov test. 



C Numerical issues for very strongly correlated pixels 

Very strong correlations between pixels can lead to numerical instabilities in the de-correlation 
method, as we show in the following. This can be of relevance if the noise in a CMB map 
is so low that it cannot make up for the missing power on small scales, which is cut by 
the beam of the experiment. We study these effects with simulated CMB data without 
noise. We first look at a Healpix-map of nside = 512. The latter has very small pixels 
(0.013 deg^) as compared to the WMAP beam (0.048 deg^ for the W-band and 0.26 deg^ for 
the Q-band). Thus, neighbouring pixels are presumably strongly correlated, resulting in a 
covariance matrix with eigenvalues close to zero. We therefore expect to run into numerical 
problems as discussed in Section 2.3. Indeed, when we take into account the WMAP beam, 
we even obtain very small negative eigenvalues, which we think are due to numerical problems 
in the diagonalization routine. In this case, we cannot take the square-root of the eigenvalue 
and eq. (2.12) becomes ill-defined. 

Even if we do not take into account the beam, but we use all the power up to ^max = 
1400 for both the correlation function and the simulated maps, we do not get a meaningful 
result. The covariance matrix still has eigenvalues close to zero, which can lead to numerical 
instabilities in eq. (2.12). We show the effect of this in Fig. 13: Using neighbouring (i.e. 



- 22 - 





Figure 13. Numerical issues: Empirical partition function of using (very strongly correlated) 
neighbouring pixels (left), and using every third pixel only {right). When we use neighbouring pixels, 
the curves are biased to higher values of A^. 




Figure 14. Investigation of effects from small n: Left panel: 100 realizations of the empirical partition 
function of A^ for m = 712 independent ring segments of n = 143 pixels each. 

Right panel: Same for for m = 712 independent ring segments of n = 2000 pixels each. The curves 
for n = 143 lie mostly to the left of the Kolmogorov-function, whereas the ones for n = 2000 scatter 
more symmetrically around the latter. 



strongly correlated) pixels heavily biases the curves, whereas using only every third pixel 
yields unbiased curves. Not being aware of this effect can lead to wrong conclusions when 
interpreting the results of the Kolmogorov-Smirnov test. However, we have verified that this 
effect is no longer present when we add the detector noise of the WMAP experiment to the 
simulations. 

D Finite-size effects 

In Section 2.2, we have looked at the effects from small n on the partition function of A^, and 
we found that the the partition function for n ^ 100 is already very close to the Kolmogorov- 
function. In this Appendix, we study the effect from small n in more detail. 

In Fig. 14, we compare simulations of the empirical partition function of A^^ for n = 143 
with the one for n = 2000. In both plots, we used m = 712, which is what we used in 
the local analysis of WMAP data using ring segments, and we show 100 realizations of the 
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Figure 15. Effects from small n on the histogram of ^k- Upper left: n = 10, upper right: n = 143, 
bottom: n = 2000. For all histograms, we used m = 10000, and we show 5 realizations. For n ^ oo, 
we expect the histogram to become flat. 
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Figure 16. Histogram of computed from 1000 realizations. Left panel: n = 143, m = 712. The 
histogram is centered around 0.484, which means that is biased low by about 1.5 a. Right panel: 
n = 2000, m = 500. The histogram is centered around 0.495, the bias is less than 0.5 a. 



partition function. For n — 143, most of the empirical partition functions lie to the left of 
the Kolmogorov-curve. 

The same effect can be seen in Fig. 15, where we show a histogram of all $x for n = 10 
(upper left), n — 143 (upper right), and n — 2000 (bottom panel). Here, we have used 
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m = 10000 in order to minimize the intrinsic variation of the histogram, and we show 5 
reahzations. Without finite-size effects, these histograms should be flat, as we argued in 
Section 3.2.3. The finite-size effect causes a bias of $x towards smaller values. 

In order to quantify the resulting bias of $x foi" our local analysis in Section 3.2.3 
(n = 143), we compute $x for n = 143 and m = 712 for 1000 samples, and plot the resulting 
histogram of $x in the left panel of Fig. 16. We do the same thing for the global analysis in 
Section 3.3 for n = 2000 and m = 500, and plot the resulting histogram of $x in the right 
panel of Fig. 16. 

Without the finite-size effects, these should be Gaussian distributed around = 0.5 

with a standard deviation of a = l/\/l2m ^ 0.011 (0.013) for m = 712 (m = 500), as we 
have explained in Section 3.2.3. The histogram for n = 143 is centered around 0.484 rather 
than around 0.5, i.e. $x is biased low by about 1.5 a. The histogram for n = 2000 is centered 
around 0.495, i.e. the bias is less than 0.5 a. 

An analogous analysis yields that for n = 256, $x is distributed around 0.489. We used 
this in Section 3.2.3 when comparing the results of the de-correlation method with theoretical 
expectations. 

E Residual correlations between ring segments and global samples 

In Fig. 17, we study the effects of residual correlations between the ring segments and between 
the global samples. One would expect that these correlations cause the partition functions 
of to be too steep as we have argued above. However, in Fig. 17, we verify that for our 
analyses we do not yet see any effect of this. 
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Figure 17. Investigation of effects from correlations between ring segments {top) and between samples 
of the global analysis {bottom). In the top (bottom) panels, we show 100 (20) realizations of the 
simulated empirical partition function of obtained from the following samples: 
Top left: m = 712 independent samples of n = 143 pixels each. 

Top right: m = 712 ring segments of an opening angle of 20° (n ~ 143), and a gap between rings and 
ring segments of 1°, extracted from simulated CMB maps. 
Bottom left: m = 500 independent samples of n = 2000 pixels each. 

Bottom right: m = 500 global samples of n = 2000 pixels each, randomly drawn from simulated CMB 
maps. 



References 

[1] J. Mather et al.. Calibrator Design for the COBE Far-Infrared Absolute Spectrophotometer 
(FIRAS), Astrophys. J. 512, 511 (1999) [astro-ph/9810373]; 

J. Fixen, The Temperature of the Cosmic Microwave Background^ Astrophys. J. 707, 916 
(2009) [arXiv:0911.1955]. 

[2] A.N. Kolmogorov, Sulla determinazione empirica di una legge di distribuzione^ G. 1st. Ital. 
Attuari 4, 83 (1933); 

V. Arnold, Orbits^ statistics in chaotic dynamical systems^ Nonlinearity 21, T109 (2008). 

[3] V.G.Gurzadyan, A. A. Kochdiiy din ^Kolmogorov stochasticity parameter measuring the 
randomness in Cosmic Microwave Background , Astron. & Astrophys. 492, L33 (2008) 
[arXiv:0810.3289]; 

V.G. Gurzadyan et al., Kolmogorov cosmic microwave background sky^ Astron. & 
Astrophys. 497, 343 (2008) [astro-ph/0811.2732]; 

V.G. Gurzadyan et al.. Plane-mirroring anomaly in the cosmic microwave background 
maps, Astron. & Astrophys. 498, LI (2009) [arXiv:0903.3273]; 



-26- 



V.G. Gurzadyan et al., The power spectrum of the cosmic microwave background 
Kolmogorov maps: possible clue to correlation of voids, Astron. & Astrophys. 506, L37 
(2009) [arXiv:0909.2162]; 

V.G. Gurzadyan et al., A weakly random Universe?, Astron. & Astrophys. 525, L7 (2011) 
[arXiv: 101 1.3711]. 

[4] S.K. Naess, Application of the Kolmogorov-Smirnov test to CMB data: Is the universe 
really weakly random?, arXiv: 1105.5051. 

[5] J. Michaud, Test de Komogorov modifie et application au CMB, Travail de Master 
Bi-disciplinaire (physique et mathematiques) a TUniversite de Geneve (2009). 

[6] H. Lilliefors, On the Kolmogorov-Smirnov Test for Normality with Mean and Variance 
Unknown, Journal of the American Statistical Association 62, 399 (1967). 

[7] D. Larson et al.. Seven- Year Wilkinson Microwave Anisotropy Probe (WMAP) 

Observations: Power Spectra and WMAP-Derived Parameters, Astrophys. J. Suppl. Ser. 
192, 16 (2011) [arXiv:1001.4635]. 

[8] R. Durrer, The Cosmic Microwave Background, Cambridge University Press (2008). 

[9] U. Sawangwit and T. Shanks, Lambda- CDM and the WMAP power spectrum beam profile 
sensitivity, arXiv:1006.1270vl [astro-ph.CO]. 

[10] L. Toffolatti et al., Extragalactic Source Counts and Contributions to the Anisotropics of 
the Cosmic Microwave Background. Predictions for the Planck Surveyor mission, MNRAS 
297, 117 (1998) [astro-ph/9711085]. 

[11] M.R. Nolta et al., Five-Year Wilkinson Microwave Anisotropy Probe (WMAP) 
Observations: Angular Power Spectra, Astrophys. J. Suppl. Ser. 180, 296 (2009) 
[arXiv:0803.0593]. 

[12] K.M. Gorski et al., HEALPix - a Framework for High Resolution Discretization, and Fast 
Analysis of Data Distributed on the Sphere , Astrophys. J. 622, 759 (2005) 
[astro-ph/0409513]. 

[13] M. Doran, CMBEASY:: an Object Oriented Code for the Cosmic Microwave Background, 
JCAP 10, Oil (2005) [astro-ph/0302138], www.cmbeasy.org. 



-27- 



